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Abstract — We study the stopping times of gossip algorithms for 
networlf coding. We analyze algebraic gossip (i.e., random linear 
coding) and consider three gossip algorithms for information 
spreading: Pull, Push, and Exchange. The stopping time of 
algebraic gossip is known to be linear for the complete graph, but 
the question of determining a tight upper bound or lower bounds 
for general graphs is still open. We take a major step in solving 
this question, and prove that algebraic gossip on any graph of 
size n is 0{An) where A is the maximum degree of the graph. 
This leads to a tight bound of 0(n) for bounded degree graphs 
and an upper bound of 0{n^) for general graphs. We show that 
the latter bound is tight by providing an example of a graph 
with a stopping time of Q{n^). Our proofs use a novel method 
that relies on Jackson's queuing theorem to analyze the stopping 
time of network coding; this technique is likely to become useful 
for future research. 

I. Introduction 

Randomized gossip-based protocols are attractive due to 
their locality, simplicity, and structure-free nature, and have 
been offered in the literature for various tasks, such as ensuring 
database consistency and computing aggregate information 
lfT2l . lfT3l . |l2J. Consider the case of a connected network with 
n nodes, each holding a value it would like to share with 
the rest of the network. Motivated by wireless networks and 
limited resource sensor motes, in recent years researchers have 
studied the use of randomized gossip algorithms together with 
network coding for this multicast task [T6l, (T4\. The use of 
network coding protocols for multicast has received growing 
attention due to the ability of such protocols to significantly 
increase network capacity. 

In this work we consider algebraic gossip, a gossip-based 
network coding protocol known as random linear coding ifTol . 
In the discussion on gossip-based protocols we distinguish 
between the gossip algorithm and the gossip protocol. A 
gossip algorithm is a communication scheme in which at every 
timeslot, a random node chooses a random neighbor to com- 
municate with. We consider three known gossip algorithms: 
PUSH: a message is sent to the neighbor, PULL: a message 
is sent from the chosen neighbor, and EXCHANGE: the two 
nodes exchange messages. The gossip protocol, on the other 
hand, determines the content of messages sent. In algebraic 
gossip protocol, the content of the messages is a random 
linear combination of all messages stored by the sending 
node. Once a node has received enough independent messages 
(independent linear equations) it can solve the system of linear 
equations and discover all the initial values of all other nodes. 

It has been proved lfT4l that network coding can improve 
throughput of the network by better sharing of the network 
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resources. Let us look at the following example f7l. Consider 
a Butterfly Network (see Figure [T} with sources Si and 5*2, 
each is wishing to multicast to both Ri and i?2. All links have 
capacity 1. Without network coding, the maximum achievable 
source-destination rate (assuming both rates are equal) is 1.5, 
due to a bottleneck at node C. Using a simple linear network 
coding, we can "xor" the information coming from Si and 
52 at the node C. By doing so, each receiver will obtain two 
linear equations: i?i will get: xi and xi + X2, and i?2 will 
get: X2 and xi + X2- Now, each receiver is able to solve this 
simple linear system and discover xi and X2- It is clear that 
the resulting throughput is now 2. 

Here we study the performance of algebraic gossip on ar- 
bitrary network topologies, where information is disseminated 
from all nodes in the network to all nodes, i.e., all-to-all dis- 
semination. Previously, algebraic gossip was considered with 
PUSH and PULL gossip algorithms, here we also study the use 
of EXCHANGE, which can lead to significant improvements for 
certain topologies (as we show). Our main goal is to find tight 
bounds for the stopping time of the algebraic gossip protocol, 
both in expectation and with high probability (w.h.p.), i.e., 
with probability of at least 1 — ^. 

The stopping time question, i.e., bounding the number of 
rounds until protocol completeness, has been addressed in the 
past. Deb et al. [6| studied algebraic gossip using PULL and 
PUSH on the complete graph and showed a tight bound of 
&{n), a linear stopping time, both in expectation and with 
high probability. Boyd et al. 12, 13 studied the stopping 
time of a gossip protocol for the averaging problem using 
the EXCHANGE algorithm. They gave a bound for symmetric 
networks that is based on the second largest eigenvalue of the 
transition matrix or, equally, the mixing time of a random walk 
on the network, and showed that the mixing time captures 
the behavior of the protocol. Mosk-Aoyama and Shah L18l 
used a similar approach to l|2l, El to analyze algebraic gossip 
on arbitrary networks. They consider symmetric stochastic 
matrices that (may) lead to a non-uniform gossip and gave 
an upper bound for the PULL algorithm that is based on 
a measure of conductance of the network. As the authors 
mentioned, the offered bound is not tight, which indicates that 
the conductance-based measure does not capture the behavior 
of the protocol. 

A recent independent work by Vasudevan and Kudekar ||2TI 
also offered the use of EXCHANGE together with algebraic 
gossip. Moreover, the authors give a uniform strong bound on 
algebraic gossip for arbitrary networks: 0{n\ogn) in expec- 
tation and 0{n\o^ n) with high probability. The worst case 
performance of algebraic gossip was not previously addressed 
in the literature. 
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Fig. 1. (a) - Without network coding, bottleneck at C brings the capacity 
to 1.5. (b) - With network coding, C transmits at each time unit information 
that is helpful to both receivers, thus the capacity is 2. 



Overview of Our Results 

The main contribution of this paper is new tight bounds 
for the stopping time of algebraic gossip on arbitrary graphs 

First, in Theorem [4] we disprove the resuhs of |21| by 
providing a graph for which algebraic gossip takes n{n^) 
rounds. Our main result then, Theorem[T| gives an upper bound 
of 0{An) for the stopping time of algebraic gossip on any 
graph , where A is the maximum degree in the graph. 

Theorem 1. For the asynchronous (synchronous) time model 
and for any graph G of size n with maximum degree A, the 
stopping time of algebraic gossip is 0(An) rounds both in 
expectation and with high probability. 

This result immediately leads to two interesting corollaries. 
In Corollary[T]we state a matching upper bound to Theorem]?]: 
for any graph of size n, since the max degree IS. — n, algebraic 
gossip will stop w.h.p. in 0{n^) rounds. In Corollary [2] we 
conclude a strong tight bound for any constant degree network 
(i.e., A is constant) of 0(n). This improves upon known 
previous upper bounds which, for certain constant degree 
graphs, had an upper bound of O(ri^). Note that the bound 
of 0(An) is not tight for all graphs (e.g., the complete graph) 
and the question of determine the properties of a network that 
capture tightly the stopping time of algebraic gossip is still 
open. 

The second contribution of the paper is the technique we 
use to prove our results. We novelly bound the stopping time 
of algebraic gossip via reduction to network of queues and by 
the stationary state of the network that follows form Jackson 's 
theorem for an open network of queues. The idea of using a 
queuing theory approach for network coding analysis was first 
introduced in |15| but, as opposed to our approach, it did not 
include the aspect of gossip communication model. We believe 
that the type of reduction presented in this work could be used 
for future analysis of gossip protocols. 

Third, we compare three gossip algorithms: PUSH,PULL, 
and EXCHANGE. While traditionally algebraic gossip used 
PULL or PUSH as its gossip algorithms, it was unclear if using 
EXCHANGE can lead to significant improvements. We give a 
surprising affirmative answer to this question and prove that 
using the EXCHANGE algorithm can be unboundedly better 



than using PULL or PUSH. We show that while the time it 
takes to the EXCHANGE algorithm to complete the algebraic 
gossip on the Star graph is linear i.e. Q{n) the time it takes 
the PULL and PUSH algorithms to finish the same task is 
f2(nlogn). On contrary, there are many other graphs such as 
the Complete graph, and all the bounded degree graphs (see 
Section |IV| i that these three gossip algorithms have the same 
asymptotical behavior 

The rest of the paper is organized as follows. In Section 
[n] we present the communication and time models, define 
gossip algorithms, gossip protocols, and formally state the 



gossip stopping problem. In Section III we show that algebraic 
gossip on the Ring graph is linear using Jackson's theorem. In 
Section jlV] we prove our main results: a tight upper bound for 
arbitrary networks and a tight linear bound for graphs with a 
constant maximum degree. Section |V] gives an answer to the 
question: 'Is EXCHANGE can be better than PUSH or PULL?' 
by providing a topology for which EXCHANGE is unboundedly 
faster. We conclude in Section IVll 

II. Preliminaries and Models 

A. Network and Time Model 

We model the communication network by a connected 
undirected gi'aph G — G{V, E), where V = {vi,V2, Vn} is 
the set of vertices and E C V x V is the set of edges. Let 
N{v) C V he a set of neighbors of node v and d^, — \N{v)\ 
its degree, let A = max^, dy be the maximum degree of G. 

The time is assumed to be slotted. We consider n consec- 
utive timeslots as one round. We consider the following time 
models: 

• Asynclironous time model. At every timeslot, a node 
selected independently and uniformly at random takes an 
action and a single pair of nodes communicate^ In this 
model there is no guarantee that a node will be selected 
exactly once in a round; nodes can be selected several 
times or not at all. 

• Synchronous time model. At every round, a random per- 
mutation of the nodes is selected uniformly and timeslots 
in the round are taken by nodes according to their order 
in the permutation. At each timeslot, a single pair of 
nodes communicates. The model guarantees that during n 
consecutive timeslots each node will be selected exactly 
once. 

B. Gossip Algorithms 

A gossip algorithm defines the way information is ex- 
changed or spread in the network. At each timeslot a single 
node takes an information spreading action that is divided 
into two phases: (i) choosing a communication partner and 
(ii) spreading the information. A communication partner 
u e N{v) is chosen by node v ^ V with probability 
Pvu- Throughout this paper we will assume uniform gossip 
algorithms, i.e., 
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^Alternately, this model can be seen as each node has a clock which ticks 
at the times of a rate 1 Poisson process and there is a total n clock ticks per 
round (2). 
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We distinguish three gossip algorithms for information 
spreading between v and u, PUSH, PULL, and EXCHANGE as 
explained in the Introduction. We assume that messages sent 
in timeslot t are received in timeslot t and can be forward or 
processed at timeslot < + 1. 



C. Algebraic Gossip 

A gossip protocol is a task that is being executed using 
gossip algorithms, for example, calculation aggregate func- 
tions, resource discovery, and database consistency. We now 
describe the algebraic gossip protocol for the multicast task: 
disseminating n initial values of the nodes to all n nodes. 

Let ¥q be a field of size q, each node Vi ^ V holds an 
initial value Xi that is represented as a vector in F^. We can 
represent every message as an integer value bounded by M, 
and therefore, r = [log^(Af)]. All transmitted messages have 
a fixed length and represent linear equations over F^. The 
variables of these equations are the initial values Xi E FJ and 
a message contains the coefficients of the variables and the 
result of the equation, therefore the length of each message 
is: r log2 q + n logj q bits. A message is built as a random 
linear combination of all messages stored by the node and 
the coefficients are drawn uniformly at random from F^. 
A received message will be appended to the node's stored 
messages only if it is independent of all linear equations 
(messages) that are already stored by the node and otherwise it 
is ignored. Initially, node Vi has only one linear equation that 
consists of only one variable corresponding to Xi multiplied by 
a coefficient 1 and equal to the value of Xi, i.e., the node knows 
only its initial value. Once a node receives n independent 
equations it is able to decode all the initial values and thus 
completes the task. 

For a node v at timeslot t, let Sy{t) be the subspace 
spanned by the linear equations (or vectors) it stores (i.e., the 
coordinates of each vector are the coefficients of the equation) 
at the beginning of timeslot t. The dimension (or rank) of a 
node is the dimension of its subspace, i.e., dim{Sy{t)) and it 
is equal to the number of independent linear equations stored 
by the node. 

We say that a node w is a helpful node to node u at the 
timeslot t if and only if Si,{t) ff. Su{t), i.e., iff a random linear 
combination constructed by v can be linearly independent with 
all equations (messages) stored by u. We call a message a 
helpful message if it increases the dimension of the node. 
The following lemma, which is a part of Lemma 2.1 in fE\, 
gives a lower bound for the probability of a message sent by 
a helpful node to be a helpful message. 

Lemma 1 (|6|). Suppose that node v is helpful to node u at 
the beginning of the timeslot t. If v transmits a message to u 
at the timeslot t, then: 

Pr {dim{Su{t + 1)) > dim{Su{t))) > 1 ^ |- 
that is, the probability of the message to be helpful is at least 




D. The Gossip Stopping Problem 

Our goal is to compute bounds on time and number of 
messages needed to be sent in the network to complete 
various gossip protocols over various gossip algorithms. For 
this purpose we define the following: 

Definition 1 (Excepted and high probability stopping times). 
Given a graph G, gossip algorithm A, and a gossip protocol 
V, the stopping time T[A, V , G) is a random variable defined 
as the number of timeslots by which all nodes complete the 
task. E\T{A, V , G)] is the expected stopping time and the high 
probability stopping time T is defined as follows: 

f{A,r, G) - mm [t I Pr {T{A, T', G) < t) > 1 - i] 

We can now express our research question formally: 

Definition 2 (Gossip stopping problem). Given a graph G, a 
gossip algorithm A and a gossip protocol V, the gossip stop- 
ping problem is to determine E[T{A,'P, G)] and T{A^ V, G), 
the expected and high probability stopping times. 

In this work we consider A G {PUSH, PULL, EXCHANGE} 
and V = algebraic gossip, so when these parameters and G 
are understood from the context, we denote the expected and 
high probability stopping times as E[T] and T respectively. 
Moreover, we usually measure the stopping time in rounds 
where one round equals n consecutive timeslots. Thus, we 
define the expected number of rounds as E[R] — E[T]/n 
and R = T/n as the number of rounds by which all nodes 
complete the task with high probability. 

For clarity, we present our proofs for the asynchronous time 
model and the EXCHANGE algorithm, we extend the results to 
the synchronous cases and PUSH and PULL in the appendix 
where we also put some of the more technical proofs. 

III. Linear Bound on a Ring via Queuing Theory 

Before proving the main results of Theorem [T] in the next 
section we prove in this section a bound on the specific case of 
a Ring network. This is a simpler case to prove and understand, 
and will be used as a basis for the proof of the general result. 
A Ring of size n is a connected cycle where each node has a 
one left and a one right neighbor. 

Theorem 2. For the asynchronous time model and the Ring 
graph of size n, the stopping time (measured in rounds) 
of algebraic gossip is linear with high probability, and in 
expectation i.e., R = Q{n) and E [R\ = Q{n). 

Proof: The idea of the proof is to reduce the problem 
of network coding on the Ring graph to a simple system of 
queues and use Jackson's theorem for open networks to bound 
the time it takes helpful messages to cross the network. 

To simplify our analysis, we cut the Ring in an arbitrary 
place and get a Path graph (without loss of generality, we 
assume that the leftmost node in the Path is t^i and the 
rightmost node is w„), see Fig.|2](a). It is clear that the stopping 
time of the algebraic gossip protocol will be larger in a Path 
graph than in a Ring graph. Another simplification that we 
will do, for the first part of the proof, is to consider only the 



4 




Fig. 2. Modeling algebraic gossip in a Path as a queuing network, (a) - Initial Path graph, (b) - One real customer at each node, (c) - Queues are filled 
with dummy customers and real customers enter the system from outside. 



messages that travel from left to right (towards w„) (i.e., other 
messages will be ignored, thus increasing the stopping time). 

We define a queuing system by assuming a queue with a 
single server at each node. Customers of our queuing network 
are the helpful messages, i.e., messages that increase the rank 
of a node they arrive at. This means that every customer 
arriving at some node increases its rank by 1, so the queue 
size at a node represents a measure of helpfulness of the node 
to its right-hand neighbor (i.e., the queue size is the number 
of independent linear equations that the node can generate for 
its right-hand neighbor). The service procedure at node u, is a 
transmission of a helpful message (customer) from Vi to Wi+i. 
So, from Lemma [T] the probability that a customer will be 
serviced at node Vi in a given timeslot is: p 
where | • i = ^ is the probability that in the EXCHANGE 
algorithm a message will be sent from Vi to Vi+i at any given 
timeslot. 

Thus, we can consider that a service time in our queuing 
system is geometrically distributed with parameter p. The 
service time is distributed over the set {0,1,2,...}, which 
means that a customer that enters an empty queue at the end 
of the timeslot can be immediately serviced with probability 
p (since it is the beginning of the next timeslot). A customer 
cannot pass more than one node (queue) in a single timeslot 
so, we define the transmission time as one timeslot. I.e., the 
time needed for a customer to pass through k queues is the 
sum of the waiting time in each queue, service time in each 
queue and additional k timeslots for transmission from queue 
to queue. 

The following lemma shows that the service rate can be 
bounded from below by an exponential random variable. The 
proof of this lemma can be found in Appendix [P] 

Lemma 2. Let X be a geometric random variable with 
parameter p and supported on the set {0, 1,2,.. .}, i.e., for 
k G Pr {X ~ k) ~ {l—p)''p, and let Y be an exponential 
random variable with parameter p. Then, for all x € M^.- 

Pt{X <x)>Pt{Y <x) = l~e-P'', (1) 

i.e., a random variable Y ^ Exp{p) stochastically dominates 
the random variable X ^ Geom{p). 

We can now assume that the service time is exponentially 
distributed with parameter jj. = p. This assumption decreases 
the rate of transmission of helpful messages, and therefore 
will only increase the stopping time. The last is true since the 



probability that a customer will be serviced by the time ti in 
a geometrical server is higher than in an exponential server, 
and thus, each customer in a network with geometric servers 
will arrive at i>„ by time t2 with higher probability than in a 
network with exponential servers. The formal justification of 
this step is given later in Lemma |5] which prove this assertion 
for trees and not only for the line. 

To this end, we have converted our network to a standard 
network of queues where the network is open, external arrivals 
to nodes will form a Poisson process, service times are 
exponentially distributed and the queues are first come first 
serve (FCFS). For a queue i let jii denote the service rate and 
\i the total arrival rate. We present now Jackson's theorem for 
open networks; a proof of this theorem can be found in |4|. 

Jackson's Theorem. In an open Jackson network of n queues 
where the utilization pi — — is less than 1 at every queue, the 
equilibrium state probability distribution exists, and for state 
(ki , fc2, . . . , kn) is given by the product of the individual queue 
equilibrium distributions: 7r(fci, fc2, kn) = Ilf^iP^' (1 ^ pi). 

We would like to use Jackson's theorem to conclude that 
there is an equilibrium state for our network of queues 
and that in the equilibrium state the lengths of the queues 
are independent. For Jackson's theorem to hold we need to 
appropriately define the arrival rate to the queues, so we will 
slightly change our queuing network. 

The initial state of our system is that at every queue we 
have one real customer (see Fig. |2] (b)). Now we take all the 
n real customers out from the system and let them enter back 
via the leftmost queue with a predefined arrival rate. Clearly, 
this modification increases the stopping time. We define the 
real customers' arrivals as a Poisson process with rate A = ^. 
So, Pi = ^ = ^ < 1 for all queues (i S [l..rt]). 

Now, according to Jackson's theorem there exists an equi- 
librium state. So, our last step is to ensure that the lengths 
of all queues at time < = are according to the equilibrium 
state probability distribution. We add dummy customers to all 
the queues according to the stationary distribution. By adding 
additional dummy customers (we call them dummy since their 
arrivals are not counted as a rank increment) to the system, 
we make the real customers wait longer in the queues, thus 
increasing the stopping time. Our queuing network with the 
above modifications is illustrated in Fig.|2](c), where the real 
customers are dark, and the dummy customers are bright. 

We will compute the stopping time in two phases. By the 
end of the first phase, the node u„ will finish the algebraic 



5 



gossip task. By the end of the second phase, all the nodes will 
finish the task. For the first phase, we will find the time it 
takes the n'th (last) real customer to arrive at the rightmost 
node, i.e., node u„. By that time, the rank of node w„ will 
become n and it will finish the algebraic gossip protocol (i.e., 
it received n helpful messages). Let us denote this time (in 
timeslots) as T"" + T^, where T° is the time needed for the 
n'th customer to arrive at the first queue, and is the time 
needed for the n'th customer to pass through all the n queues 
in the system. 

For the second phase, let us assume that after T° + 
timeslots (when finishes the algebraic gossip task) all nodes 
except node u„ forget all the information they have. So, the 
rank of all nodes except u„ is 0. Let us now analyze the 
information flow from the rightmost node in the Path (u„) 
to the leftmost node (vi). In the same way, we will represent 
all helpful messages that node u„ will send as customers in 
our queuing system. In order to use Jackson's Theorem, we 
will again remove all the real customers from the system and 
will inject them to the queue of node w„ with a Poison rate 
A = /i/2. We also fill all the queues in the system with dummy 
customers in order to achieve queue lengths that correspond 
to the equilibrium state distribution. Clearly, arrival of a real 
customer at some node Vi (i 7^ n) will increase the rank of 
that node. So, after the last real customer arrives at node vi, 
the ranks of all nodes will be n, and the algebraic gossip task 
will be finished. 

Using the same equilibrium state analysis as before, we 
define the time it takes the last real customer to arrive at the 
rightmost node Vn as T'^, and the time to cross all the n queues 
- arriving at node vi - as T'*. 

So, T = + T'' + r= + T'^, is an upper bound for the 
number of timeslots needed to complete the task. Now we will 
find upper bound for T^, a; G {a, b, c, d} and then we will use 
union bound to obtain an upper bound on T. 

From Jackson's Theorem, it follows that the number of 
customers in each queue is independent, which implies that 
the random variables that represent the waiting times in each 
queue are independent. To continue with the proof we need 
the following lemmas; the first is a classical result from 
queuing theory, the proof of the second lemma can be found 
in Appendix |E] 

Lemma 3 (|20|, section 4.3). Time needed to cross one 
M/M/l queue in the equilibrium state has an exponential 
distribution with parameter fi ^ X. 

Lemma 4. Let Y be the sum of n independent and identi- 
cally distributed exponential random variables (each one with 
parameter /i > Oj, and E \Y] = ^. Then, for a > 1: 

Pr {Y < aE [Y]) > 1 - (2e-"/2)n^ ^2) 

Recall that: u = » > ^(1 - i) so /i > > ^ for g > 2. 
The random variable is the sum of n independent random 
variables distributed exponentially with parameter /i/2. From 
Lemma [3] we obtain that is the sum of n independent 
random variables distributed exponentially with parameter [i — 
A = ^. It is clear that T"" is distributed exactly as T° and 



is distributed exactly as T^. Therefore (for /i ~ ^): E [T^] — 
S"=i f = 4n^. Using Lemma |4j (with a = 2) we obtain for 

X € {a, 6, c, d}: 

Pr(T"<8n2) >l-(f)". (3) 

Using union bound, we get that; 

Pr (T < 32n2) > Pr (n^r"= < Sn^) (4) 
= 1 - Pr (U^T"^ > %n^) (5) 

>l-4(i)" (6) 

where x e {a, 5, c, d}. It is clear that Pr {T < 32n^) increases 
when fi increases (faster server yields smaller waiting time); 
hence, the above inequality holds for any /i > 5^. 

So, for the asynchronous time model and EXCHANGE we 
obtain an upper bound for the high probability stopping time: 
T = 0{n?) in timeslots, and thus R — 0{n), in rounds. Let 
us now find an upper bound for the expected number of rounds 



needed to complete the task - E [i?] : 

E [R] = iE [T] (7) 

= iE [T" + T** + + T"^] (8) 

= lE [T^] = Hn^ = I6n = 0{n). (9) 



The lower bound is clear since in order to finish the alge- 
braic gossip task each node has to receive at least n messages, 
so at least messages need to be sent and received. Since 
in each timeslot at most 2 messages (using EXCHANGE) are 
sent, we get: f = n{n'^), thus R = n{n), and E [R] = n{n). 

The result of the Theorem|2]is then follows: E[R] — 0(n), 
and R = e(n). ■ 

IV. Algebraic Gossip on Arbitrary Graphs 

Now we are ready to prove our main results. First, we 
present the upper bound for any graph as a function of its 
maximum degree A, and then we give corollaries that are 
applications of this result for more specific cases. 

Theorem 1 (restated). For the asynchronous time model and 
for any graph G of size n with maximum degree A, the 
stopping time of algebraic gossip is O(Ari) rounds both in 
expectation and with high probability. 

Proof: Consider an arbitrary graph G of size n with a 
maximum degree A and a vertex v. First, let us perform a 
(Breath First Search) BFS on G starting at v. The search results 
with a directed shortest path spanning tree Gy, rooted at v 
where each node has a single edge toward it parent in the 
tree (except the root v). In the analysis we will only consider 
messages that sent on the edges of the spanning tree in the 
direction toward v, i.e., we ignore all messages received from 
non tree edges or in the opposite direction (see Fig. |3](b)). 

Now, let us concentrate on the infoimation flow towards the 
node V from all other nodes. As in the proof of Theorem [2] we 
will define a queuing system with a queue at each node (see 
Fig.[3](c)). The following lemma shows that we can model the 
service time at each queue as an exponential random variable 
with parameter ^ = p. 
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(c) (e) 



Fig. 3. Reduction of Algebraic Gossip to a system of queues, (a) - Initial graph G„. (b) - BFS spanning tree r„. (c) - System of queues 
Qn""^ ■ (d) - System of queues Q\^^^^- Stopping time of Q^i^l^^ larger than of Qn"" ■ (e)-Taking all customers out of the system and use 
Jackson theorem for open networks. 



Let G„ be a tree of size n rooted by node v. Let M{Gy,S) 
be a network of n queues where for each node u in Gy there 
is a queue and the queue output is connected to the input of 
the queue corresponding to the parent of u in G^. In addition, 
each queue is of infinite size and initially has one customer 
in the queue (see Fig. [3]) (c)). The servers of all the queues 
work with a service time distributed as S. Let T{Gv,S) be the 
random time by which all the n customers in Af{Gv,S) arrive 
to the queue of v (we assume v does not serve the customers). 

Lemma 5. For any tree G^ and < p < 1: 

PT{T{Gy,Geom{p)) <t)> Pt {T{Gy, Exp{p)) < t) . 

The result of this lemma is that any probabilistic upper 
bound on the stopping time of u in a tree network with 
exponential servers holds for the same tree network with 
geometric servers (both with the same parameter p and initially 
one customer at each queue). 

Once all real customers anive at v, it will reach rank n and 
will finish the algebraic gossip task. Now we have to calculate 
the service time parameter p. The degree of each node in G is 
at most A. Each node in Gy ,except v, has a parent. Since we 
virtually remove (i.e., ignore) all edges that do not belong to 
Gy, at each node there is exactly one edge that goes towards 
the root v. Therefore, the probability that a customer will be 
serviced (transmitted towards v) at the end of a given timeslot 
is at least: p > f- • ^) (1 - -), where - • 4- ^ is the 
probability that in the EXCHANGE algorithm a message will 
be sent on the edge that goes towards v during one timeslot, 
and (1 — is the minimal probability that the message is 
helpful (Lemma [TJi. Clearly, p > ^ for g > 2 so we set our 
exponential servers to work with rate /i = 



Theorem 3. Let Ql^'^'^ be a network of n nodes arranged 
in a tree topology, rooted at the node v. Each node has an 
infinite queue, and a single exponential server with parameter 
II. Initially, there is a single customer in every queue. The time 
by which all the n customers leave the network via the root 
node V is t{Q^^'^'^) =^ 0{n/ j^i) with high probability. Formally, 
for any a > 1: 

Pr (i(Qr') < aAn/fi) > 1 - 2(2e-"/2)". (10) 

The main idea of the Theorem [3] proof is to show that the 
stopping time of the network Q^'^'^ (i.e., the time by which 
all the customers leave the network) is stochastically [j smaller 
or equal to the stopping time of the systems of ^,nax queues 
arranged in a line topology - Q;""^ ('max is the depth of the 
tree Q^'^'^)- Then, we make the system Q;™*^ stochastically 
slower by moving all the customers out and make them enter 
the system via the farthest queue with the rate A = fi/2. 
Finally, we use Jackson's Theorem for open networks (similar 
to the proof of Theorem [2]l to find the stopping time of 
the system. See Fig. [3] for the illustration. The full proof of 
Theorem [3] can be found in the Appendix [K| 

Using Theorem [3] for the tree Gy and with /i = ^ , we 
obtain the stopping time of the node v: Ty < aAn^A with 
probability of at least 1 - 2(2e""/2)" 

The same analysis holds for any node u G V, i.e., perform- 
ing BFS starting from the node u, defining a queuing system 
on the tree Gy, and finding the stopping time of u, Ty. So, 
we can use a union bound to obtain the stopping time of all 

^For completeness, stochastic dominance is formally defined in Appendix 
|K1] 
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the nodes in G: 



Pr j Pi T„ < a4n2A) ) > 1 - 2n{2e-°''^Y. (11) 



By letting a = 2 we obtain: 



Pr 



fl r„ < A) j > 1 



(12) 



So, we obtained that the stopping time of the algebraic gossip 
in G is O(An^) timeslots with high probability and thus: R = 
0{An). 

The high probability bound of Eq. ( [TT| ) is true for any a > 1 
and therefore strong enough to bound the expectation (see 
proof in the Appendix [Fji and finish the proof of Theorem [T] 

E[T] = 0{An^) and E[R] = 0(An). (13) 

■ 

From Theorem [T| and since the maximum degree is at most 
n we can derive a general upper bound of algebraic gossip on 
any graph. 

Corollary 1. For the asynchronous time model and any graph 
G of size n, the gossip stopping time of the algebraic gossip 
task is 0{n?) rounds with high probability and in expectation. 

We can use Theorem [T] to obtain a tight linear bound of 
algebraic gossip on graphs with a constant maximum degree. 
We note that previous bounds for this case are not tight, for 
example, for the Ring graph the bound of |18| is O(n^). 

Corollary 2. For the asynchronous time model and any 
bounded-degree graph G of size n with a constant maximum 
degree A, the gossip stopping time of the algebraic gossip 
task is linear rounds with high probability and in expectation. 

We now show that the upper bound 0{An) presented in 
Theorem [T] is tight in the sense that for each 2 < A < n there 
exists a graph for which algebraic gossip takes n{An) rounds. 

Theorem 4. For any 2 < A < n/2 + 1 and for the 

asynchronous time model there exists a graph G of size n with 
maximum degree A for which algebraic gossip takes $7 (An) 
rounds with high probability, in particular there is a graph for 
which the stopping time is r2(?i^) rounds with high probability. 

Proof We present here the proof for the case of an even 
n and A = n/2 + 1 which can be easily extended to smaller 
A. In order to prove this result we will need the following 
lemma, the proof can be found in Appendix [G] 

Lemma 6. Let X be a sum of m independent and identically 
distributed geometric random variables with parameter p, i.e., 
X = -^i- Then, for any positive integer k < m/p 



Pr(X > fc) > 1 



' kp 



(14) 



Let n be even and consider the Barbell graph which is build 
from two disjoint cliques of size n/2 connected by a single 
edge between u and v (see Figure]?] (a)). We will concentrate 
on the information flow from u to v. It is clear that to achieve 



a rank n node u has to receive ^ helpful messages from v. 
Using the fastest gossip variation - EXCHANGE the probability 
p that a helpful message will be sent in one timeslot from v 



to u is: 



-To > P > - • 49(1 - -)■ Hence: p < 



The 



n n/2 — ^ — n n/2 ^ 

number of timeslots, T, needed to the node v to send n/2 
helpful messages can be viewed as a sum of n/2 geometric 
random variables with parameter p. Clearly, E [T] = ^ • - > 
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timeslots. Using Lemma |6] with k = [n'^/lGj, P = :^ 



I „3/ig I > (we took p even larger than its maximum value; 
this will make calculations nicer and will not affect the bound), 
and m = n/2 we get: 



Pr(T > [nVl6j) > 1 



'kp 



(15) 



= l-(yi/2)". (16) 

It is clear that Pr (T > k) increases when p decreases (the 
smaller probability of success - the larger the probability to 
finish later). Hence, the above inequality holds for any p < 

Thus, the number of timeslots needed is at least [n^/16j 
w.h.p. and n'^/8 in expectation. So, the total stopping time of 
the algebraic gossip protocol over the Barbell graph (measured 
in rounds) is: R = fl{n'^), and E [R] = f7(n^). ■ 

V. EXCHANGE CAN BE UNBOUNDEDLY FASTER THAN 

PUSH OR PULL 

As we presented earlier, there are three gossip variations: 
PUSH, PULL, and EXCHANGE. In PUSH or PULL there is 
only one message sent between the communication partners, 
in EXCHANGE there are two messages sent. Thus, the total 
message complexity for the same number of communication 
rounds is doubled. So, we would like to know: is the stopping 
time decrease, when using EXCHANGE, worth the doubling 
message complexity? In this section we give the answer by 
presenting a graph for which the EXCHANGE gossip algorithm 
is unboundedly faster than the PUSH or PULL algorithms. 

Theorem 5. For the Star graph Sn (which is a tree of n 
nodes with one node having degree n — 1 and the other 
n — 1 nodes having degree 1), for the synchronous time 
model, algebraic gossip using EXCHANGE is unboundedly 
better than using PUSH or PULL algorithms. Formally, for 

A e {PUSH, pull}: 



RjA) 
:XCHAK 
E[R{A)] 



lim — '"—^ > 00 , and 

R{exchange) 



lim , , 

n-i-oo E [R(exchange)] 



00. 



(17) 



(18) 



Proof: 

The proof of this theorem is a direct consequence of the 
following lemmas. 

Lemma 7. For the Star graph Sn, for the synchronous time 
model, algebraic gossip using PUSH takes r2(n^) rounds with 
probability 1. 

Proof: We are interested in a lower bound, so we will 
consider the minimum number of round to complete the task. 
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Fig. 4. (a) - Barbell graph: two cliques of size ^ connected with a single edge, (b) - Extended barbell graph: additional node between the cliques. 



The center node could finish the algorithm after one round 
since all other nodes will send (PUSH) to it their messages 
and in the best case all these messages will be helpful, so we 
ignore this phase. Now, the center node should send (PUSH) 
to every other node n — 1 independent linear equations. In the 
synchronous time model, the center node wakes up exactly 
once in a round. Thus, the number of rounds needed to PUSH 
n ~ 1 messages to all the n — 1 other nodes is at least (n — 
1) • (n — 1) with probability 1. ■ 

Lemma 8. For the Star graph Sn, for the synchronous time 
model, algebraic gossip using PULL takes f2(nlogn) rounds 
with high probability. 

Proof: First, we give the following claim for the coupon 
collector problem lil7J . The proof of the claim can be found 
in Appendix [H] 

Claim 1. Let X be the r.v. for the number of coupons needed 
to obtain n distinct coupons, then: 

E[X] = 6(nlogn) and w.h.p. X ~ 9(rilogn). 

The center node will finish the algorithm once it receives 
(PULL) a helpful message from every other node. Thus, the 
center node has to reach (PULL) every other node at least 
once. In the synchronous time model, the center node will 
transmit once in n consecutive timeslots (or once in one 
round). Reaching every other node at least one time is exactly 
the coupon collector problem (or balls into bins), so (using 
Claimllll: R = il(?ilogn), and E [R] = ri{n\ogn) rounds. ■ 

Lemma 9. For the Star graph Sn, for the synchronous time 
model, algebraic gossip using EXCHANGE takes 0(ri) rounds 
with high probability and in expectation. 



claim 



Proof: To prove Lemma |9] we will use the following 

n 



Claim 2. Let Xi be independent geometric random variables 
with parameter p, and let X — X]"=i -^i- P ^ h 
a > 1: 



Pr {X > 2na) < (2 



1.5-a^ 



(19) 



Let us split the task into two phases, the first phase is the 
time (in rounds) Ri until the center node vi learns all the 
initial messages, i.e., dim{Sy^{t)) = n. The second phase is 
the time (in rounds) R2 it takes vi to distribute the information 
to all the nodes. 



Initially, every node u E V \ {vi} is helpful to vi. By 
Lemma [T] a message sent from u to vi will be helpful with 
probability of at least 1 ^ ^, thus, after n rounds, a node u 
will send a helpful message to vi with probability of at least 
1 — (i) (for q > 2). Using union bound we can find the 
probability that all the nodes u G V\{vi} will send a helpful 
message to vi after n rounds: 



Pr(i?i >n) < 



E 

uev\{vi} 



< n 



(20) 



Now, from the beginning of phase two, and since 
dim{Sv^) — n, node vi will be helpful to every other node 
until the rank of that node becomes n. From Lemma [T] a 
message transmitted to some node from a node helpful to it, 
will increase its dimension with probability p > I — 

Let us define X^ as the number of rounds needed for vi 
to increase the rank of some node u E V\{vi}. It is clear 
that X^ has a geometric distribution with parameter p. We 
are interested to find = X]"=i ^j"' which represents the 
number of rounds by which the rank of node u will become 
n. Using Claim |2] (and the fact that for g > 2, p = 1 - ^ > i), 
we obtain for any a > 1 that: 



Pr(X" < 2an) > 1 - (2i-5-")" 



(21) 



Using union bound, we obtain the probabihty that ranks of all 
nodes will become n after 2an rounds: 



Pr (U„ev\{,,}X" > 2an) < ^ ^an) 

uev\{vi} 



<n(2^-^-°') 



and thus: 



Pr (n„en{„,}X" < 2an) > 1 - n (2i-^-")" 



So, 



Pr(i?2 < 2an) > 1 - n (2i-5^")" , 



and for a = 2: 



Pr(i?2 < An) > 1 - n 



V2 



(22) 
(23) 



(24) 



(25) 



(26) 
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Combining the two phases together, i.e., R < Ri+ R2, we 
have: 



Pr(i? > 5n) < Pr(i?i > n) + Pr(i?2 > 4n) 

/ 1 " 



< n 



< 2n 



V2 



1 

7i 



(27) 
(28) 

(29) 



and thus: R = 0{n). 

Let us now find an upper bound for the expected number of 
rounds needed to complete the task, E[R]. Since R < Ri + 
R2, we get: E [i?] < E [Ri] + E [R2]. During the first phase, 
each node u G V \ {vi} will send a helpful message to vi 
with probability of at least i. Thus, E < 2n. The high 
probability bound of ( |25] l allow us to show that for sufficient 
large n: 

E[R2] < 471 + 1 (30) 

The proof of Eq. ([30]) can be found in Appendix |j] Hence, we 
obtain: E [R] = 0(n). ■ 
Since for A € {PUSH, PULL}: R{A) = fi(nlogn) and 
E [i?(„4)] = fl{n\ogn) (Lemmas [t] and [8|, and from Lemma 
[9] ^(exchange) = 0{n) and E [it!{EXCHANGE)] = 0{n), 
we are ready to conclude that: 



RiA) 



lim 

n^oo i?(EXCHANGE) 



00 , and 



lim 



E[RiA)] 



E [i?(EXCHANGE)] 



VI. Conclusions 

In this work we prove tight bounds on the stopping time 
of the algebraic gossip protocol. We prove that the upper 
bound for any graph is O(n^) and we show that this bound is 
tight, i.e., there exists a graph for which the stopping time 
of algebraic gossip is ri(rt^). Our general upper bound is 
provided as a function of the maximum degree A of a graph 
and thus we can obtain a tight linear bound of 0(n) for any 
graph with a constant maximum degree. Moreover our results 
hold for q > 2 (coefficients field size) while prevoius results 
were for the case q > n. 

It is still an open question to determine the properties of a 
network that capture the stopping time of algebraic gossip. 
To illustrate this, note the interesting observation that on 
the extended-Barbell graph (Fig. |4] (b)) the stopping time of 
algebraic gossip is linear So, by adding a single node to the 
graph of Fig. |4] (a) the stopping time has been changed by an 
order of magnitude?! 

A recent work of Haeupler |9| (to appear in STOC 2011), 
that followed the conference version of this paper 11], answers 
the question we ask here. Haeupler's paper makes a significant 
progress in analyzing the stopping time of algebraic gossip. 
While all previous works on algebraic gossip used the notion 
of helpful message/node to look at the rank evaluation of the 
matrices each node maintains (this approach was initially pro- 
posed by 0), Haeupler used a completely different approach. 



Instead of looking on the growth of the node's subspace 
(spanned by the linear equations it has), he proposed to look at 
the orthogonal complement of the subspace and then analyze 
the process of its disappearing. This elegant and powerful 
approach led to a very impressive result: a tight bound of 
Q{n/j) was proposed for all-to-all communication, where 7 
is a min-cut measure of the related graph. This matches our 
tight bounds for many topologies (e.g., constant maximum 
degree graphs, barbell graph, etc.) and extends them for other 
graphs. Haeupler proposed also results for many-to-all (fc to 
n) communication, but these bounds are not always tight. In 
the future work we aim to find tight bounds for the many- 
to-all communication with uniform algebraic gossip and also 
to analyze the non-uniform gossip approaches for information 
dissemination. 
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Appendix 

A. Algebraic gossip with synchronous time model, with PUSH 
and PULL 

In this section of Appendix we give theorems and corollaries 
that extend the results presented in the paper to both time 
models (synchronous and asynchronous), and to the three 
gossip algorithms (PUSH, pull, and EXCHANGE). 

The first theorem shows that the general upper bound for 
algebraic gossip holds also for the synchronous time model. 

Theorem 6. For the synchronous time model and for any 
graph Gn with maximum degree A, the stopping time of 
algebraic gossip is 0(An) rounds with high probability. 

Proof: The proof for the synchronous time model is 
almost the same as the asynchronous case. The analysis will 
be done in rounds instead of timeslots. The probability that 
a customer will be serviced (transmitted towards v) at the 
end of a given round is at least: p > (l — i^^)^) (1 — 
where (l — (^^)^) — ^ — the probability that in the 

EXCHANGE algorithm at least one message will be sent on a 
specific edge (in a specific direction) during one round, and 
(1 — i) is the minimal probability that the message is helpful 
(Lemma [TJ. 



P>(i-i.)(i-i)>i(i-i) 



for q>2. 



A V-^ qJ — 2A 

If the node i has received a message during a specific round 
from the node j it will ignore the additional message that can 
arrive from the same node j at the same round (this can happen 
if i chooses j and j chooses i in the EXCHANGE gossip scheme 
in one round). Clearly, this assumption can only increase the 
stopping time since we ignore (possibly helpful) information. 

r^, X e {a,b} are measured now in rounds and not in 
timeslots. Since id = p > i^, using Lemma H (with a = 2 
and E [T^] = ^ = 4nA), we obtain: 

Pr (T^ < StiA) > 1 - (f )" , for x e {a, b}. 

The rest of the proof is the same as in the asynchronous 
case and thus the result follows. 

■ 

The following theorem proves that the worst-case lower 
bound for algebraic gossip holds also for the synchronous time 
model. 

Theorem 7. For the synchronous time model there exists a 
graph Gnfor which algebraic gossip takes f2(n^) rounds with 
high probability. 

Proof: The proof is almost the same as in Theorem |4] 
The analysis will be done in rounds instead of timeslots. 

Using the fastest gossip variation - EXCHANGE the proba- 
bility p that a helpful message will be sent in one round from 

V to u is: (1- >p> (1- (^)^)(1- i). Hence: 



> 



P<;i72-(W^^-Clearly,E[i?]^f .1 

Using Lemma 6 (with fc = [n^/lGj, p = n[WJW\ 
m — n/2), and itmeasured in rounds, we get: 



rounds. 



Vl6 



and 



Pr(i?> [nVieJ) > 1- 



{Ve/2y 



The rest of the proof is the same as in Theorem |4] ■ 
The following corollary shows that our bounds (upper and 
lower) for algebraic gossip on general graphs hold also for the 
PUSH and PULL gossip algorithms. 

Corollary 3. The results of Theorems^ |6] and^hold also 
for PUSH and PULL gossip algorithms. 

Proof: By moving from the EXCHANGE to the PUSH or 
PULL gossip algorithms, we change only the probability of 
sending a helpful message on a specific (directed) edge, i.e., 
the service time at each node will change. Easy to see that this 
probability will be decreased by a factor of 2 (i.e., the service 
time will become twice longer). Clearly, such a reduction will 
not affect the asymptotical bounds that were achieved using 
Lemmas |4) and |6] ■ 

B. EXCHANGE is better than PUSH and PULL in the asyn- 
chronous time model 

The next theorem shows that the benefit of EXCHANGE 
over PUSH and PULL in the Star graph, holds also for the 
asynchronous time model. 

Theorem 8. The result of Theorem |5] holds also for the 
asynchronous time model. I.e., for the Star graph Sn, for the 
asynchronous time model, algebraic gossip using EXCHANGE 
is unboundedly better than using PUSH or PULL algorithms. 
Formally, for A £ {PUSH, PULL): 



lim . 

R{exchange) 

lim ^[^(-^)] 



oo , and 



E [R{exchange)] 

Proof: The proof is based on the following lemma: 

Lemma 10. After kn timeslots, in the asynchronous time 
model, the node v & V will take at most l.bk and at least 
0.5k communication actions with high probability. Formally, 
by defining Y"" as a random variable that represents the 
number of communication actions taken by the node v during 
kn timeslots, we have: 

fc/8 



Pr(y" < 0.5k) < - 



Pr(y" > 1.5fc) < 



and 



k/12 



(31) 



(32) 



In addition, by defining as a random variable represent- 
ing the number of timeslot needed to the node v to perform 
m communication actions, we get: 



--^x fc/8 

Pr(To-.5fc>"fc)< ( -j .and 



PTiTl,,<nk)< 



k/12 



(33) 



(34) 



Proof: Let us define an indicator random variable Y^" 
that will indicate whether the node v took the communication 
action at the timeslot i. From the definition of the gossip 
algorithm, Pi{Y^ = 1) = - for all i, and Y^" are independent. 
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Thus, Y^, which is the random variable that represents the 
number of communication actions taken b^ the node v during 
kn timeslots, can be written as: Y" = ■ 

So, is a sum of kn independent Bernoulli random 
variables, and E \Y'"\ — kn^ — k. From the results presented 
in i,l?J (Theorem 4.4 p. 64, and Theorem 4.5 p. 66) the bounds 
on follows directly. 

The bounds on follow from the fact that T,^ is a sum of 
m geometric random variables (each one with parameter 
and the duality between the sums of geometric and Bernoulli 
random variables (we have explained this principle in the proof 
of Lemma [6]). ■ 

Now we will show that the results of Lemmas [7] and |8] hold 
also for the asynchronous time model, i.e., for the Star graph 
Sn, for the asynchronous time model, algebraic gossip using 
PUSH or PULL takes il(nlogn) rounds with high probability 
and in expectation. 

The proof for the asynchronous case is almost the same 
as in the synchronous case. The difference comes in the next 
statements. In the synchronous time model, the center node 
will transmit once in n consecutive timeslots (or once in one 



round). From Lemma 10 in the asynchronous time model, the 



center node will transmit (take a communication action) 1.5A: 

_k_ 

times after at least k rounds with high probability (1— (i) '-^ ), 
and after 1.5k rounds in expectation. Reaching every other 
node at least one time, is exactly the coupon collector problem 
(or balls into bins), which requires (by Claim [T]l il(nlog7i) 
communication actions (both, in expectation and with high 
probability). Thus, the task can not be completed before 
rounds. So, also for the asynchronous time model: 
R — il(nlogn) , and E [R] = il(nlogn) rounds. 

We have to show now that the result of Lemma |9] holds 
also for the asynchronous time model, i.e., for the Star graph 
Sn, for the asynchronous time model, algebraic gossip using 
EXCHANGE takes 0{n) rounds with high probabiUty and in 
expectation. 

During the proof of Theorem [5] we did not assume any 
specific order of sending messages. Thus, the stopping time 
is determined only by the number of communication steps 
performed by every node in the network. The first phase is 
finished (both, in expectation and with high probability) after 
every node u ^ V \ {vi} performs at most n communication 
actions. The second requires from the center node vi to 
perform at most 4rt communication actions. From Lemma 10 
in the asynchronous time model, each 0.5fc communication 
actions are happened after at most k rounds. So, phase 1 
will be finished after at most 2n rounds, and phase 2 will be 
finished after at most 8n rounds. Thus, in the asynchronous 
time model: R = 0{n) , and E [R] = 0{n) rounds. ■ 

C. Proof of Lemma |5] 

Lemma |5] (restated): For any tree G„ and < p < !.• 

Pr(r(G„,Geom(p)) <t)> ¥v {T{G^,Exp{p)) < t) . 

Proof We will prove this claim by showing that for each 
customer c and for each queue u on the unique path that c 



traverse to the root the probability that c reaches u before time 
t is larger in N{G„,G&om[p)) than in A/'(Gt,, Exp(p)). 

Consider a reverse topological order of the nodes in G^, 
w^, w^, . . . , — V, i.e., for every node v\ 1 < i < n, the 
parent of is a node and j > i. For a node let C" be the 
set of customers that it need to serve on their way to the root. 
For a node and a customer c G C" let GUt) denote the event 
that c reached before time t in JV{Gv,Geom{p)) and let 
£l{t) be defined similarly for J\f {G^, Exp {p}). We claim that 
for each 1 < i < n, and each c G G\ Pr(^^(i)) > Pr{£'^{t)) 
and the proof will be by induction on i. 
Induction basis: Fr{gl^{t)) > Pr(fii(i)). By definition 
vi is a leaf with one customer, itself, and no children so 
Pr(t/i,W)=Pr(f,Mi)) = lfort>0. 
Induction step: Assume the claim is true for 1 < i < n — 1 
and we will prove it is true for i + 1. If is a leaf then 
we are done since this is an identical case to the base case. 
Assume is not a leaf. The case c = v^~^^ is trivial so 
consider c G (7*+^ that is not i>*+^. Then c must reach i>*+^ 
via one of its children, let it be v'^ where fc < z + 1. Then by 
the induction assumption Pi-{Q^{t')) > Pr(£'^'(t')) for any t', 
and from Lemma |2] for any t the probability that a customer 
will be served by time t is larger in A/'(G„, Geom(p)) than in 
A/'(Gt,, Exp(p)), so we have a faster arrival rate and a faster 
service rate and the claim follows. ■ 



D. Proof of Lemma [2] 

Lemma|2](restated): Let X be a geometric random variable 
with parameter p and supported on the set {0, 1,2,.. .}, i.e., 
for k G Z+.- Pr {X ^ k) = (1 - pfp, and let Y be an 
exponential random variable with parameter p. Then, for all 
X G M+.- 



Pr (X < > Pr (r < x) = 1 - e 



-px 



(35) 



i.e., a random variable Y ^ Exp{p) stochastically dominates 
the random variable X ^ Geom{p). 

Proof: For a geometric random variable X with a success 
probability p and supported on the set {0,1,2,3,...}: 



Pr {X> x) = {l- pY+\ for xeZ 



and 



Ft{X >x) = (1 -|))L^J+i , for X G M+. 
So, for a; G M+, 

Pr{X <x) = l-(l-p)L^J+i > l-{l-py = I_e'"(i-P)^ 

and since ln(l — p) < —p we have: 

Pr(X < a;) > 1 - e"''"^. 

Hence, if Y ^ Exp(p) we obtain: 

Pt{X <x)>l~ e-P'^ = Pr {Y < x) , 

i.e., random variable Y ^ Exp(p) stochastically dominates the 
random variable X ^ Geom(p). 
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E. Proof of Lemma [5] 

Lemma |4] (restated): Let Y be the sum of n independent 
and identically distributed exponential random variables ( each 
one with parameter /i > 0), and E [Y] = ^. Then, for a > 1: 

Pr (Y < aE [Y]) > 1 - (2e-"/2)n, (36) 
Proof: The generating function of X is given by: 

/■OO 

JQ 

For any s < S: Gx{s) = gzr^- Thus, the generating function 
of Y (sum of independent Xi's) for s < S: Gy(s) — 

(Gx(s))" = (y^) ■ Now, we will apply a Chernoff bound 
on Y. For 6 > s > 0: 

Pr (r > aE [Y]) ^Pr(Y > a^) 



By letting s = (5/2 we get: 



Pr(y > aE[Y]) < 



((5- |)e"5i 



E [e-^] ^ Gy{s) 



and thus: 



Pr {Y < aE [Y]) > 1 - (26^"/^)" . 



F. Proof of the expectation result in Theorem [7] 

Theorem [T] states that the expected stopping time of alge- 
braic gossip on any graph is: E[T] = O(An^) timeslots. 

Proof First, we rewrite the high probability result of Eq. 
[TTlwith a > 1: 

Pr (T > 4an2A) < 2n(2e-"/2)". 

For a positive integer random variable T holds: E [T] = 
E^i > i). So, we have: 



E[T]=Y^ > i) (37) 

i=l 

8n^A-l oo 

= J2 P^iT>i)+ J2 P^iT>i) (38) 

1=1 i=8n2A 
oo 

< Sn^A + Pr(r > i) (39) 

oo 

< 8n2A + WA^Pr(T> 4an2A). (40) 



a=2 



The last inequality is true since Vi < Pr(r > i) > 
Pr{T > j) and thus we can replace all Pr(r > i) for i £ 



[Aan'^A, ...,4:{a + l)n^A-l] with An'^A x Pr(T > Aan^A). 
Hence, 

oo 

E [T] < Sn^A + 4n2A ^ Pr(T > 4an2A) (41) 

a=2 
oo 

< 8n2A + 4n2A^2n(2e-?)" (42) 

a=2 
oo 

= 8n2A + 8n3A2"^(e~"/2)" (43) 

< 

= 8n^A + 8^3 A2" ^ — (44) 



8n^A 



a=2 

1 - e-»/2 
8n3A /2\" 
1 - e-"/2 [ e . 



(45) 



for n > 6 : (46) 
< d>n^A + 8n2A. (47) 

Thus: i;[r] = 0(An2), and E[R\ = 0{An). ■ 

G. Proof of Lemma [6] 

Lemma |6] (restated): Lef X be a sum of m independent 
and identically distributed geometric random variables with 
parameter p, i.e., X — X]i!Li -^i- Then, for any positive integer 
k < m/p 



Pt{X > k)>l 



n-L-kp 

e "I kp 



(48) 



Proof First, we will define Y as the sum of k independent 
Bernoulli random variables, i.e., Y = X^iLi where Yi ~ 
Bernoulli{p). Let us notice that: 

Pr (X < fc) = Pr (r > m) 

The last is true since the event of observing at least m 
successes in a sequence of k Bernoulli trials implies that the 
sum of m independent geometric random variables is no more 
than k. From the other side, if the sum of m independent 
geometric random variables is no more than k it implies that 
m successes occurred not later than the fc-th trial and thus 
Y >m. 

Now we will use a Chernoff bound for the sum of indepen- 
dent Bernoulli random variables presented in |17|: For any 
(5>0and /i = E[r]: 

Since ^i = E\Y] = kp, and by letting 5 = ^^^^^^ we obtain: 



Pr (r > (1 + 5)^i) = Pr (y > m) < -^^^ — 



So: 



Pr {X <k) < 



m — kp 

em kp 



and thus the result follows. 
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H. Proof of Claim [7] 

Claim [T] (restated): Let X be the r.v. for the number of 
coupons needed to obtain n distinct coupons, then: 

E[X] = 6(nlogn) and w.h.p. X = 9(rilogn). 



Proof: The first results and the upper bound w.h.p. are 
well known; see for example lfT9l . ifTTl . We have not found 
a direct reference for the lower bound, namely that w.h.p. 
X — n{n\ogn), so we give an outline here. Let denote 
the event that all n different coupons have been collected 
after X steps. Let X — X^iLi -^i where Xi is an rv. that 
denotes the number of coupons of type i collected. Clearly 
Xj's are dependent. To overcome this difficulty we will use 
Poisson approximation of the binomial random variable Xi 
|[T7|. Consider n Poisson independent random variables Kj 
(i e with mean A = — . Each variable represents 

the number of coupons of type i. Thus, the expected total 
number of coupons collected is X. Let £y denote the Poisson 
version of the event £x, i.e., that after collecting the different 
types of coupons independently with Poisson distribution with 
A, we have at least one type of each coupon. Since F^'s are 
i.i.d., we have Pr (Sy) = (Pr {Y, > 1))". It is clear that both 
Pi {£x) and Pr {£y) are monotonically increasing with X; 
therefore we can use the Poisson approximation that states that 
Pi-(fx) < 2 X Pr(£y) ( (17), Corollary 5.11). Now, assume 
X — n\nn — n\n\nn and we have: 



lim Pr{£y) ^ (l 



^— (In n— In In n) 



\ In n -j^ 



Thus: lim„^oo Pr {£x) <\ and 



lim Py(X > rt In n — 71 In In n) = 1 . 

n— >oo Ji 



I. Proof of Claim [2] 

Claim |2] (restated): Let Xi be independent geometric ran- 
dom variables with parameter p, and let X = X]"=i ^i- 
p> \, and a > 1: 



Pt{X > 2na) < (2i-^-") . 



(49) 



Proof: In order to obtain this upper bound on the sum 
of n independent geometric random variables we will use 
a Chernoff bound. The generating function of a geometric 
random variable Xi is given by: 



G 



pe 



{l-p)e 



^ , where t < — ln(l— p). 



The generating function of the sum of independent random 
variables is a multiplication of their generating functions. 
Thus: 



pe 

(1 -p)eJ 



Now, we will apply Markov's inequality to obtain an upper 
bound on X. For t > 0: 



Pr {X > 2na) = Pr (e*-^ > e""") < 
By letting t = — 0.51n(l — p) we get: 



tx ^ „t2„aN ^ E [e*^] _Gx{t) 



Pt{X > 2na) < 



(1 



l_(l_p)0.5^ 

It is clear that Pr {X > 2na) decreases when p increases. 
Thus, to obtain an upper bound, we will substitute p with 
its minimal value, i.e., 1/2, and we get the result: 

(1 - 0.5)""°-50.5^ " 



Pr {X > 2na) < 



1 - (1 - 0.5) 



0.5 



<(2-0.5 
= (2i-5-")". ' 

■ 

J. Proof of the expectation result in Theorem^ 

The Eq. ^ in Theorem [s] states that: E[R2] < 4n + 1. 
Proof: First, we rewrite the high probability result of Eq. 
( p5| ) for i?2 with a > 1: 

Pr (i?2 > 2na) < n{2^-^-")''. 

For a positive integer random variable i?2 holds: E [R2] = 
Pr(-R2 > i)- So, we have: 

00 

E [R2] = ^ Pr(i?2 > i) 

2 = 1 

4n— 1 00 

= Pr(^2 > + ^ Pr(i?2 > ^) 

i—1 i—An 

00 

<4n+Y^ Pr(i?2 > i) 

00 

< 4n + 2n ^ Pr(i?2 > 2na). 

a=2 

The last inequality is true since Vi < j, Pr(-R2 > «) > 
Pr(i?2 > j) and thus we can replace all Pr(i?2 > i) for 
i e [2na, 2n{a + 1) - 1] with 2n x Pr(i?2 > 2na). Hence, 

00 

E [R2] < 4n + 2n ^ Pr(i?2 > 2na) 

a=2 
00 

< 4n + 2n^n(2i-5-")" 

a=2 

00 

= 4n + 2n2 .2i-5"^(2-")" 

Q = 2 

9 — 2n 
2 nl.bn ^ 



4n + 2n^ • 2 



1 - 2- 



2 o0-5n 



= An 



2n^ ■ 2 



2" - 1 
and for n > 19 : 

< 4?i + 1. 
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K. Proof of Theorem [i] 

For the proof of this theorem we need the following 
definitions and lemmas. 
1} Stochastic Dominance: 

Definition 3 (Stochastic dominance, stochastic ordering ifTTl . 
lISl). We say that a random variable X is stochastically less 
than or equal to a random variable Y if and only ifPT{X < 
t) > Pt{Y < t), and such a relation is denoted as: X ^Y. 

Claim 3. If for i G {1, 2}, ^ Yi, Xi are independent, and 
Yi are independent, then: max^ X^ -< max^ Yi. 

Proof: 

Pr(maxX, < t) = P|Pr(X, < t) = J|Pr(Xi < t) 

i i 

>Y[Pr{Yi <t)= Pr(maxy, < t). 

i 

Hence: 

laaxXj -< maxY;. 



Claim 4. If for i G {l, 2}, Xi ^ Yi, Xi are independent, and 
Yi are independent, then: Xi ^ Yi. 



Proof: 



Pr(Xi +X2<t)= I fx,+x, {s)ds, 
where = fxA^) * ^(s). 



Thus: 



' — OO 

/>oo 



/•OO 

/ fxAr)fxJs^ 

J —OO 


- T)dTds 


fxAr) Pr(X2 <i- 


- T)dT 


fxAr) Pr{Y2<t^ 


T)dT 


/ fxAr)fYAs- 

J — OO 


T)dTds 


/ fYAr)fxAs- 

J —OO 


T)dTds 


fYAr)PT{X,<t- 


T)dT 


fYAr)MYi<t- 


T)dT 


r fYAr)fYAs~ 


T)dTds 



Hence: 



Pr(ri + y2 <t). 



i=l 1=1 



2) Later arrivals yield later departures: Consider an infi- 
nite FCFS queue with a single exponential server. We define 
ai as the time of arrival number i to the queue, and di as 
time of the departure number i from the queue. Let Xi be the 
exponential random variable representing the service time of 
the arrival i. For all i, Xi's are i.i.d. 

Let ai be a sequence of m arrival times to the queue, and 
di be a sequence of m departure times from the queue. 




ai (12 03 a4 05 





















X2 




X3 




Xi 
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di (12 ds di d^ 

di = max{ai, di^i) + Xi 
Fig. 5. AiTival and departure times. 

Lemma 11. If the sequence ai is replaced with another 
sequence of m arrivals - di, such that: di ^ ai^i ^ [1, m], 
then, the resulting sequence of m departures will be such 
that: di >: di Vi G [1,...,to]. I.e., if every new arrival 
occurred , stochastically, at the same time or later than the 
old arrival, then, every new departure from the queue will 
occur , stochastically, at the same time or later than the old 
departure. 

Proof: The proof is by induction on the arrival index j, 
j G [1,...,™]. 

• Induction basis: di h di- Follows since di — ai + Xi, 
di = fli + Xi, and di '(Z 0,1- 

• Induction assumption: Vi < j : di ^ di. 

• Induction step: we need to show that dj ^ dj. 

If the j's arrival occurred when the server was busy, then 



di 



dj-i + Xj. If the server was idle when the j's arrival 



occurred, then dj = aj + Xj. Thus, we can write: 



dj — max((ij_i, Uj) + Xj, 
and dj = max(dj_i, dj) + Xj. 



(50) 
(51) 



Since from induction assumption: dj^i >z rfj-i, and dj >: aj, 
using Claims Hand [4) we obtain dj h dj. ■ 
Proof of Theorem |5| 
We denote the nodes of the queuing system Q^'^'^ as Zj, 
where I (l G [1, ^max]) is the level of the node in the tree, 
and j is the node's index in the level /. The root of the Q^^^"^ 
tree is the node Zl. All servers in the Qli'^'^ network are ON 
all the time (work-conserving scheduling), i.e., servers work 
whenever they have customers to serve. There are no external 
arrivals to the system. Once a customer is serviced on the level 
I, it enters the appropriate queue at the level I — 1. When a 
customer is serviced by the root it leaves the network. 
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Now, let us define the auxiliary queuing systems: Q^^^ ™d 

Definition 4 (Network Qn'^'^)- Qn'^^ is the same network as 
Qtree y^jff^ the fallowing change in the servers' scheduling: 

At any given moment, only one server at every level I (I (z 
[1, Zinax]j is ON. Once a customer leaves level I, a server 
that will be scheduled ( turned ON) at the level I, is the server 
which has in its queue a customer that has earliest arrival time 
to a queue at the level I among all the current customers at the 
level I. If there are customers that initially reside at the level 
I, they will be serviced by the order of their IDs ( we assume 
for analysis that every customer has a unique identification 
number). 

Definition 5 (Network of queues Qj*"*^). Qj*"*^ is the the 
following modification of the network Q^n'^'^, that results in a 
network of Imax queues arranged in a line topology. 

For all I g [1, .., ^max], w merge all the nodes at the level 
I to a single node (a single queue with a single server). We 
name this single node at the level I as the first node in Q^^'^'^ at 
the level I, i.e., Z\. The customers that initially reside at level 
I will be placed in a single queue in tlie order of their IDs. 
This modification results in Qj*'"^ - a network of ly^ax queues 
arranged in a line topology: z}-^'"'-^ — >■ _^ . . . _j. 

Definition 6 (Network of queues Q;™^)- Q/™"^ - is the same 
system as Qj'""^ with the following modification. We take the 
last customer at some node Z™ (m € [l,..,iniax^ l]j ond 
place it at the head of the queue of the node Z™'^^. I.e., we 
move one customer, one queue backward in the line of queues. 

Definition 7 (Network of queues Q;*"^^)- Q\^2L ~ same 
system as Q\"'''^ with the following modification. We move all 
the customers to the queue Z^"""'. I.e., all the customers have 
to traverse now through all the Z^ax queues in the line. 

We summarize the queuing systems defined above in the 
short Table U 



Vn 


Original system of n queues aiTanged in a tree topology. Fig. 
|6](a). 


/\tree. 


System of n queues arranged in a tree topology. Only one 
server is active at each level at a given time. Fig. |6|(b). 




System of /max queues amnged in a line topology. Fig. |6j 
(c). 


Qline 


System of Zniax queues arranged in a line topology. One 
customer is moved one queue backward. 


Qline 


System of /max queues arranged in a line topology. All 
customers are moved backward to the queue zj™"" . 



TABLE I 

Queuing systems used in the proof. 



The proof of Theorem [3] consists of showing the following 
relations between the stopping times of the queuing systems: 



Stopping time of a queuing system t{Q), is the time by 
which the last customer leaves the system (via the node Zl). 
In order to compare the stopping times of queuing systems, 
we define the following ordered set (or sequence) of departure 
time from a server Z in a queuing system Q: d{Z, Q) = 
{di{Z,Q),d2{Z,Q),...,d,{Z,Q),...), where d,{Z,Q) is the 
time of the departure number i from the node (server) Z. 

First, we want to show that the stopping time of Q*^'^'^ is at 
most the stopping time of the system Q^'^'^, i.e., t{Q^^'"^) < 

tiQlD- 

Lemma 12. In Q^^"^*^, every departure from the system ( via 
Z\) will occur, stochastically, at the same time or later than 
in Q^^- 

d,{ZlQnn h d,{ZlQlD Vz e [l,...,n]. (52) 

Thus, in Ql^"^*^, the last customer will leave the system, 
stochastically, at the same time or later than in Qll'^'^, or: 

Proof: The proof is by induction on the tree level / G 

1 , . . . , ^max] • 

.Induction basis: \fiJ:di{Z^'^^^,Q*^^'^) >z 
c/i(Z}"^^Q^'-^'=). This is true since in Ql^"", the 
nodes do not work all the time, and thus the departures 
will occur, stochastically, at the same time or later than 
in Qn"^^- If there is a single node at the level Zmax, in 
Q*^""^ it will be ON all the time as in Qll^\ and thus, 
the departures will occur, stochastically, at the same time 
in both systems. 

• Induction assumption: for all ^ > m (to > 1), 

• Induction step: we need to show that: 

V*,j : d,{Z^,QlD h 
By induction assumption, for I — m + 1: 
V*,j : d,;(Z;"+i,Q*:-) h d,(Zf+\Qj:-). Now let us 
take a look at the departures from a node Zj". There are two 
cases: Z™ is a leaf, and Z™ is not a leaf. If Z™ is a leaf, 
we can use the same argument as in the induction basis: in 
Qn^"^, the node Z™ does not work all the time, and thus the 
departures from it in Q^'^'^ cannot occur earlier than in Qn'^^. 
If Z™ is not a leaf, it has input/inputs of arrivals from the 
level TO + 1. Since the arrivals from the level to + 1 in Ql^^'^ 
occur, stochastically, at the same time or later than in Q^i^'^ 
(by induction assumption), even if the node Z™ would work 
all the time (as in Q^'^'^), we would obtain from Lemma 11 
V«,j : d^iZY',Q'^") h d,{Z^,Ql^^^). Moreover, in Q^'''=^ 
the node Z™ does not work all the time (unless it is the only 
node at the level m), thus the departure times in Q^'^'^ can 
be even larger ■ 

Lemma 13. In Qj*'"^ , every departure from the system ( via 
Z\) will occur, stochastically, at the same time as in Ql^'^^. 
Thus, in Q\^"^ , the last customer will leave the system, 
stochastically, at the same time as in Q*^'^'^. 

Proof: Consider the two following facts regarding the 
network Qll'^'^. First, a customer entering the level I will be 
serviced after all the customers that arrived to the level I 
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before it, are serviced. Second, at any given moment, only 
one customer is being serviced at the level I (if there is at 
least one customer at the nodes Zj). These facts are true due 
to the scheduling of the servers in Q^'^'^ (Definition |4]i. 

Clearly, the same facts are true for the network Q;*"*^ . First, 
any customer entering to the level I will be serviced after all 
the customers that arrived to the level I before it, are serviced. 
Second, at any given moment, only one customer is being 
serviced at the level I (if there is at least one customer in the 
node Z[). These facts are true since in Q'*""^, at every level, 
there is a single queue with a single server (Definition |5]l. 

So, the departure times of every customer from every level I 
(I E [1, /max]) are, stochastically, the same in both systems. 
The departures from level I — 1 are the departures from the 
node Zl, and thus the lemma holds. ■ 

Now we are going to move one customer, one queue 
backward and will show that the resulting system will have 
stochastically larger (or the same) stopping time. 

Lemma 14. Consider a network Q^^'^ . Let m be a level 
index: m G [1, ..,/max ~ !]• take the last customer at the 
node Z'^ and place it at tlie head of the queue of the node 
and call the resulting network - Qj™^ (Fig. [t] (b)). 



Then: 



(53) 



Thus, in Q\^"^ , the last customer will leave the system, 
stochastically, at the same time or later than in Qj™*^, or: 

Proof We call the customer that was moved - customer 
c. Let us take a look at the times of arrivals to the node in 
Qiine ^jj^ Qhne gjjj^-g j-jjg customcr c is already located in 
the queue of in Qj*"*^ , its arrival time can be considered 
as 0. In (5;™*^ , the arrival time of c is at least (it should 
be serviced at Z^^^ before arriving at Z™). Each one of the 
rest customers that should arrive at will arrive in Qj*"*^ , 
stochastically, at the same time or later than in Q'""^ , since 
in Q;™"^ the server Z™^^ should first service the customer 
c, and only then will start servicing the rest customers. 
Thus, gj™";) h d,{Z];'+\Q[^^J. Using Lemmajll] 

uD ^ ^^(^"r^QjLTx)- Iteratively 
to the nodes Z[, I & [m — 1, 1], we 
obtain the result: d~(Zl , ) >z d, [Zl , Qj™'^ ) . ■ 



we obtain that: d,(Zf,Qf"^ 
applying Lemma 
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Corollary 4. Consider a network Q'™^ 
is identical to the network Qj*"'^ 



; (Definition^ which 
with the following change. 
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c 



















d(Z\,Q\l 



(a) - Network Q"^ 









D 





all the n customers 



(b) - Network 



(c) - Network QlJ™ 



d(Z;".Qi™) 



Zl 



Fig. 7. (a) - Network Qj*""^ . (b) - Network Ql['^'^, where one customer is moved one queue backward, (c) - Network Q*'"'^ , where all the customers are 
at the last queue. 



In QY^'^ , all the n customers are located at the node Z^" 



(Fig. [Zffcjj. Then: 



d,{ZlQl::j ^ d,{ZlQl^:j V* e [1, ...,n]. 



(54) 



Thus, in Qi^, the last customer will leave the system, 
stochastically, at the same time or later than in Qj™*^, or: 

Proof: Given the network Qj*"'^ we take one customer 
from the tail of some queue (except the queue of the node 
) and place it at the head of the queue of the preceding 
According to the Lemma 
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we get a 



node in the Qj™'^ 
network in which every customer leaves via stochastically, 
not earlier than in Q;™- Iteratively moving customers (one 
customer and one queue at a time) backwards we get finally 
the network Qf™*^ in which all the n customers are located at 
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the node Zj^'"'"'. Since at each step, according to Lemma 
the departure times from Z| could only get, stochastically, 
larger, the lemma holds. ■ 

Corollary 5. The time it will take the last customer to leave 
the network of n queues arranged in a tree topology is, 
stochastically, the same or smaller than in the network of n 
queues arranged in a line topology where all the n customers 
are located at the farthest queue, i.e., t{Ql^^^) ^ t{Q\'^"^). 

Proof: This corollary is a direct consequence of the 
Lemmas [12] [13] and the Corollary ]4] ■ 
Now we are ready for the last step of the proof. We will 
find the stopping time of a system of queues arranged in a line 
topology and with all the customers located at the last queue. 



Lemma 15. The time it will take to the last customer to leave 



the system Q^"^^ (Imax queues arranged in a line topology) is 



0{n/ ii) with high probability. Formally, for any a > 1: 

Pr (t((3i™;j < ain/f^j > 1 - 2(2e-°/2)". (55) 



Proof: Initially, all the customers (from now we will call 
them real customers) are located in the last (Zj'""'') queue. We 
now take all the real customers out of this queue and will make 
them enter the system (via the Z{"""') from outside. We define 
the real customers' arrivals as a Poisson process with rate 
\ — ^. So, p = ^ = i < 1 for all the queues in the system. 
Clearly, such an assumption only increases the stopping time 
of the system (stopping time is the time until the last customer 
leaves the system). According to Jackson's theorem, which 
proof can be found in |5|, there exists an equilibrium state. So, 
we need to ensure that the lengths of all queues at time t ~ 
are according to the equilibrium state probability distribution. 
We add dummy customers to all the queues according to the 
stationary distribution. By adding additional dummy customers 
to the system, we make the real customers wait longer in the 
queues, thus increasing the stopping time. 

We will compute the stopping time t{Q\™^ ) in two phases: 
Let us denote this time as ti +t2, where ti is the time needed 
for the n'th customer to arrive at the first queue, and t2 is the 
time needed for the n'th customer to pass through all the Z„iax 
queues in the system. 

From Jackson's Theorem, it follows that the number of 
customers in each queue is independent, which implies that 
the random variables that represent the waiting times in each 
queue are independent. 

The random variable ti is the sum of n independent 
random variables distributed exponentially with parameter 
/i/2. From Lemma [3] we obtain that t2 is the sum of Imax 
independent random variables distributed exponentially with 
parameter /i — A = fi/2 (Lemma [s]). Since Imax < n, we 
can assume the worst case (for the upper bound of stopping 
time) Imax = n. Thus, we can view t2 as the sum of n 
independent random variables distributed exponentially with 
parameter /i/2. E [ti] = X^^Li 2//^ = "^n/ii, so, using Lemma 



EI 

Pr {h < aE [h]) > 1 - (26-"/=^)", (56) 
Pr {ti < a2n/fi) > 1 - (2e-"/2)"^ (57^ 

In a similar way we obtain: 

Pr {t2 < a2n/fi) > 1 - (26""/^)". (58) 
l-^Qhne ''j = ti + <2, thus, using union bound: 

Pr {h +t2< aAn/fi) > 1 - 2(26-"/^)". (59) 
and thus: 

t{Q^CD = 0{n/n) (for a constant a) (60) 
w.p. of at least 1 - 2(26""/^)". 

■ 

From Claim|5]we obtain that tiQll"^") ^ t{Q\^l) and thus: 
tiQu"") < a4n//x w.p. of at least 1 - 2(2e~"^2)n foj. ^ny 
a > 1, so the proof of Theorem [3] is completed. ■ 



